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Background: Diagnostic DNA arrays for detection of point mutations as markers for cancer 
usually function in the presence of a large excess of wild type DNA. This excess can give rise to 
false positives due to competitive hybridization of the wild type target at the mutation spot. The 
analysis of the DNA array data is typically qualitative aiming to establish the presence or absence 
of a particular point mutation. Our theoretical approach yields methods for quantifying the analysis 
so as to obtain the ratio of concentrations of mutated and wild type DNA. 

Method: The theory is formulated in terms of the hybridization isotherms relating the hybridiza- 
tion fraction at the spot to the composition of the sample solutions at thermodynamic equilibrium. 
It focuses on samples containing an excess of single stranded DNA and on DNA arrays with low 
surface density of probes. The hybridization equilibrium constants can be obtained by the nearest 
neighbor method. 

Results: Two approaches allow us to obtain quantitative results from the DNA array data. In 
one the signal of the mutation spot is compared with that of the wild type spot. The implementation 
requires knowledge of the saturation intensity of the two spots. The second approach requires com- 
parison of the intensity of the mutation spot at two different temperatures. In this case knowledge 
of the saturation signal is not always necessary. 

Conclusions: DNA arrays can be used to obtain quantitative results on the concentration ratio 
of mutated DNA to wild type DNA in studies of somatic point mutations. 



I. INTRODUCTION 

Cancers are attributed to accumulation of somatic mutations In turn, the mutated DNA can 
provide molecular markers of diagnostic utility Q- Among them are point mutations, involving a 
change in a single base pair in the DNA, such as those occurring in the _p53 and K-ras genes 0,01 HI- 
The detection of such point mutations is useful in screening for cancers ja as well as typing the cancer 
in order to optimize the treatment protocol • DNA microarrays 0,|9| > " DNA chips" , are among 
the analytical techniques of proven potential to this end [|| ^j. The detection of somatic point 
mutations is hampered by the presence of an excess of wild type DNA. This favors hybridization of 
the wild type single stranded DNA (ssDNA) with mismatched sequences resulting in false positives. 
A similar problem occurs in analyzing single nucleotide polymorphism of pooled samples 0. In 
the following we present a theoretical analysis of the errors introduced by such mishybridization 
utilizing equilibrium thermodynamics. It suggests methods of quantifying the detection of point 
mutations by DNA chips. In particular, these methods allow to obtain the ratio of concentrations 
of mutated and wild type DNA in the sample. This ratio is of diagnostic interest and it provides a 
systematic method for the minimization of false positives. The numerical calculations illustrating 
this approach are based on recent DNA chip studies of point mutation in the K-ras gene 0, ^| . 

DNA chips consist of a support surface carrying " spots" H, Q . Each spot comprises numerous 
oligonucleotides of identical and known sequence, "probes", that are terminally anchored to the 
surface. The spots are placed in a checkered pattern such that each sequence is allocated a unique 
site. Each probe hybridizes preferentially with a ssDNA containing a complementary sequence 
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referred to as "target". In a typical experiment the DNA microarray is immersed in a solution 
containing a mixture of labelled ssDNA fragments of unknown sequence. The presence of a given 
sequence is signalled by the hybridization on the corresponding spot as monitored by correlating 
the strength of the label signal with the position of the spot. False positives can occur because 
each probe can also hybridize with a mismatched sequence. When the DNA microarray is designed 
to investi gate gene expression it is possible to optimize the probe design in order to minimize this 
effect [Til 112 • However, the implementation of this strategy is more difficult for studies of point 
mutations. When studying somatic point mutations the situation is further complicated by an 
excess of wild type, non-mutated, DNA. Two observations substantiate this point. First, solid 
tumors are heterogenous, containing a mixture both cancerous and normal, stromal, cells. The 
cancer cells are a minority and the fraction of mutated DNA obtained from homogenized tumor 
biopsy can be as small as 15% Q. The fraction is much smaller for noninvasive testing for early 
stage cancers using body fluids such as urine [5j , serum |J| or stool [3|, |6j . Since the hybridization is 
controlled by the mass action law, the excess of wild type sequence will typically contribute to the 
hybridization on the spots allocated to the point mutations. Accordingly, the ratio of intensities 
of different spots may not reflect the ratio of concentrations of DNA species in the sample. This 
mishybridization contribution will increase with the ratio of wild type DNA to mutated DNA 
thus diminishing the efficacy of the early stage screening. A similar situation is encountered in 
the analysis of pooled single nucleotide polymorphism samples. This problem can be resolved by 
determining the contribution of the wild type DNA to the signal of the mutation spots. As we 
shall discuss, this is possible when three conditions are fulfilled: (i) the hybridization is allowed to 
reach equilibrium, (ii) The equilibrium hybridization isotherm, relating the hybridization fraction 
on the spot to the sample composition, is of the Langmuir form i.e., x/(l — x) = Kc where x is the 
fraction of surface sites that bind a reactant of concentration c and K is the equilibrium constant 
of the reaction 0]. This regime is expected when the surface density of probes is sufficiently 
low 0]. (iii) The sample contains an excess of ssDNA as is the case when using asymmetric PCR 
amplification JlH fl6| or following digestion by lambda exonuclease . Under these conditions 
the fraction of correctly hybridized probes on a spot is obtainable yielding also the concentrations 
of mutated and wild type DNA in the sample. Two approaches are possible: (a) comparing the 
signals of two spots with probes that match, respectively, the wild type and mutated sequences, 
(b) Comparing the signals of the spot corresponding to the mutation of interest at two different 
temperatures, T\ and T^,. The first approach requires measurement of the saturation signal of the 
two spots while for the second this step may be eliminated. Importantly, the experimental set up 
reported by Fotin et al satisfies the three conditions listed above and allows to implement the 
two temperature approach. 

Our analysis is based on the hybridization isotherms of DNA chips allowing for the role of two 
types of competitive hybridization 0[ . Competitive surface hybridization occurs when two different 
targets can hybridize with the same probe. Competitive bulk hybridization takes place when the 
target can hybridize with a complementary sequence in the bulk. The second process is dominant 
when the samples are produced by symmetric PCR amplification. When asymmetric PCR [l5llT^| 
is used the sample contains an excess of ssDNA which does not experience the effect of competitive 
bulk hybridization. This situation can also be obtained by digesting the product of symmetric 
PCR with lambda exonuclease ^?|]- Under these conditions the excess of ssDNA dominates the 
hybridization with the probes and competitive surface hybridization becomes the major source of 
error. In the general case the hybridization isotherm reflects the electrostatic penalty incurred 
because each hybridization event increases the charge of the probe layer [Til Il9| . We will focus on 
systems where these interactions are screened and the hybridization isotherm assumes the Langmuir 
form 0] which facilitates quantitative analysis of the data. The use of PCR amplification renders 
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FIG. 1: Competitive Surface Hybridization. The m! probes at the mutation spot can hybridize with 
both the complementary m targets and the mismatched wild type w targets. The fraction of mishybridized, 
m'w, probes is high when the w targets are present in excess. Double stranded targets are not shown. 



the absolute concentrations of DNA species meaningless. Our analysis is thus concerned with the 
ratio c m /c w of the excess concentrations of mutated (c m ) and wild form (c w ) of ssDNA. 

Our discussion focuses on the situation following amplification by asymmetric PCR (Figure 1). 
It concerns two spots: a mutation spot carrying m! probes and a wild type spot carrying w' probes. 
The sample solution comprises of a wild type ssDNA fragment w and a mutated fragment m as 
well as their complementary fragments, w' and m! . The number of w and m fragments is however 
larger. We assume that all free w' and m! fragments hybridize with the free complementary w and 
m ssDNA fragments. Our analysis focuses on the excess of unhybridized m and w fragments whose 
concentrations are denoted respectively by c m and c w . We consider the small spot limit, when the 
hybridization at the spot has a negligible effect on the bulk concentrations. In this limit the double 
stranded DNA ww' and mm' pairs in the bulk do not affect the hybridization equilibrium at the 
spot. In other words, the hybridization isotherm relates c m and c w to the measured hybridization 
signal of a spot. The targets are actually much larger than the probes. In the K-ras studies we 
cite the target is comprised of 117 [l(j to 157 |(| monomers (nucleotides) while the probes comprise 
of 13-14 monomers. As we shall discuss, this "size asymmetry" mostly affects the boundary of 
the Langmuir regime. Its effect on the hybridization isotherms of spots of high probe density 
is beyond the scope of this article. In section II we present the necessary background on the 
hybridization isotherm when competitive surface hybridization is dominant. The following two 
section discuss respectively the two-spot and two-temperature approaches for the determination of 
Cm/cw Experimental considerations, the relation to existing experimental studies as well as open 
questions are presented in the Discussion. 
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II. THE ADSORPTION ISOTHERM 



The hybridization isotherm relates the equilibrium fraction of hybridized probes on the spot to 
the composition of the sample |l4j . A simple derivation of this isotherm is possible upon equating 
the rates of hybridization and denaturation at the surface of the spot [13]. In the following we 
consider the isotherm obtained when competitive surface hybridization is important. In particular, 
both the wild type sequence, u>, and the mutated one, m, can hybridize at the mutated m! spot. 
Consider a spot carrying a total of Nx m'-probes in a sample containing unhybridized m and w 
targets of concentrations c m and c w moles per liter respectively. The total hybridization reflects 
both hybridization of the perfectly matched m targets and of the mismatched w ones. The fractions 
of m'm and m'w hybridized probes are denoted respectively by x and y. The rate of denaturation 
of mismatched m'w probes is kd m yNr while the rate of hybridization of ml probes with w targets is 
ki im c w (\ — x — y)Nx where kdm and khm are the corresponding rate constants. When electrostatic 
interactions within the layer are strongly screened, the rate constants do not depend on the probe 
density and the fraction of hybridized probes. At equilibrium the two rates are equal, kd m y^T — 
kh m c w (l - x - y)N T leading to 

J = c w ^^- = c w K m ' w = Cw exp(-AG^, tl) /J?T). (1) 



(1 - x - y) k d 

where khm/ kdm = K m ' W is the equilibrium constant [2(j. Similarly, the two rates for the perfectly 
matched m'm probes are kd P xNx and kh P c m (\ — x — y)NT- Their equality at equilibrium, kd P xNx = 
khpC m (l — x — y)Nx leads to 

ku 

7^ = c m K m , m = c m e X p(-AG° m , m /RT), (2) 



(1 - x — y) k dp 

Here K m > w and K m > m are respectively the equilibrium constants of hybridization between m' probes 
and w and m targets, AG^ m and AG® n , m are the corresponding standard Gibbs free energies, T 
is the absolute temperature and R is t he g as constant. This kinetic derivation recovers the results 
of a rigorous thermodynamic analysis |14l | with one caveat: The molar concentrations should be 
replaced by dimcnsionlcss activities = jCi where 7 is the activity coefficient 20] . Since 7 -» 1 
as Ci — y we will retain expressions used above noting that the Ci are dimensionless having the 
numerical value of the molar concentrations of the i th species. The two isotherms, Q and are 
not helpful because PCR amplification of a given sample does not allow for different labels of the 
m and w targets. Since both are labeled identically, the measurement yields the total hybridization 
fraction 8 = x + y rather than x and y separately. Accordingly, the observed isotherm for the m' 
spot is 

^rn' ~j 7 — K'm'mC'm ~l~ Km'wCw (3) 
1 — f m ' 

as obtained by summing Q and @ . The fraction of mishybridized probes on the m' spot is 

jj _ ym' _ K m 'wCw i ,\ 

= 0~ = K / c +K < c ' ^ ' 

Here, and in the following, the subscript i — m',w' of f2j, Pj etc. identifies the spot. P m i = 1/2 
when c w = c m K m > m / 'K m i w and P m i > 1/2 for c w > CmK m > m / 'K m > w . In the case of interest, when 
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i'j AH^jkcal/mole AS^cal/mole 
mm -99.40 -264.29 
m'w -78.20 -214.12 
w'w -100.80 -270.55 
w'm -93.50 -253.59 

TABLE I: The thermodynamic parameters utilized in the numerical calculations correspond to the Alanine 
12 and wild type probes utilized by Prix et al @: m' = AGCTGCTGGCGTA, m = TCGACGACCGCAT, 
w' = CTGGTGGCGTAGG, w = GACCACCGCATCC as calculated from the nearest neighbor 
model fgj] for lMNaCl. Since the targets are longer than the probes two dangling ends are invoked. 



■deg AG°,j. (37° C)kcal/ mole K, lj(37 o C) 
-17.43 2.11 x 10 12 

-11.79 2.19 x 10 s 

-16.89 8.75 x 10 11 

-14.85 3.17 x 10 10 



c w ^ c m , P m > of the m' spot is large while P w / of the w' spot 

w ' ~ T~, ~ K , c +K , c [ ' 

is always small. The values of P m > (Figure 2) for a typical situation (Table 1) confirm that compet- 
itive surface hybridization is important only when the competing species is present in large excess. 
As noted earlier, this is the case in studies of somatic point mutations when the wild type ssDNA 
is a majority component. The wild type ssDNA then contributes to the hybridization on all the 
mutation spots. In contrast, the concentrations of the different mutated ssDNA are much smaller. 
As a result their contribution to the hybridization on the m! spot, and on other mutation spots, is 
negligible. This observation justifies limiting the analysis to the competition between two species, 
m and w. 

When competitive hybridization is negligible, equation J2J reduces to fl rn i = K m i m c m and the 
corresponding isotherm for the wild type is £l w i = K w i w c w . In such cases c m /c w can be determined 
from 
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This situation can be realized in gene expression studies with proper probe design and in the absence 
of overwhelming excess of one target species. Note that 9 of the mutation spot can be tuned to be 
low, 6 <C 1 by adjusting the hybridization temperature. In such regimes one may invoke a "weak 
spot" approximation 

fit ~ 0i (7) 

As we shall discuss, when applicable this simplifies the analysis of the two temperature experiments. 
Finally, in most cases the signal of the spot i at equilibrium, Ii is proportional to OiNx and 

0i = — (8) 

1 rmax \ / 



where J™ ax is the saturated signal of the i spot as obtained upon equilibration with a concen- 
trated solution of the perfectly matched target. It is important to note that Jf 113 * can depend on 
temperature. 

The melting temperatures of double stranded DNA are used as design criterion for probes 0, 
12]. It is thus useful to note that competitive surface hybridization affects the effective melting 
temperature, T^/. Tm is defined by the condition di = 1/2 or f2j = 1. In the general case Tm 
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FIG. 2: The fraction of mishybridized probes on the mutation spot as function of the concen- 
tration of the wild type target. P m / vs. c w curves calculated at T = 47°C for (1) c m = 1CT 9 M, (2) 
Cm = l(r 10 M, (3) c m = 10 _11 M and (4) c m = 0. 



depends on both c m and c w i.e., Tm — TM{c m ,c w ). Note that there are two targets that can 
hybridize with the same probe. This situation, involving three ssDNA species and two different 
dsDNA, is different from the one invoked in the definition of the melting temperature of a dsDNA. 
In this last case two complementary ssDNA hybridize to form one type of dsDNA and the total 
number of species is three rather than five. Furthermore, we are considering the small spot limit 
where the hybridization with the probes has negligible effect on the bulk composition. In the 
absence of competitive surface hybridization Tm reduces to its familiar forms: for c w = 0, this leads 
) = AH° m ,J{AS m , m +R\nc m ) while for c m = it leads to T m , w {c w ) = AH^, W /(AS^, W + 
R\nc w ). It is not possible to obtain an explicit analytical expression for TM(c m ,c w ). However, in 
the regime of interest, c m /c w <C 1 the melting temperature is well approximated by the first two 
terms of the Taylor expansion in c m around c m = i.e., Tm(c to , c w ) ~ T m i w + c m ^ M -| Cm= o thus 
specifying the melting temperature at the mutation spot 

Tin \ rp ( n \ i „ Km'm{T m 'w) r>rp2 (q\ 

-*-M\Cm>Cw) 1 rn'w \yw ) ' c m \ \ ttO I m'w V* ) 

I m'w I 

Importantly, in this regime Tm increases steeply with c m (Figure 3) and its initial value, for c m = 
is T m i w rather than T m i m . In marked distinction, for c m /c w > 1 the effect of the competitive 
surface hybridization is negligible and Tm ~ T m i m (c m ) with a weak logarithmic dependence on c m . 

When the probe density is relatively high and the salt concentration is sufficiently low, it is 
necessary to allow for the electrostatic penalty incurred upon hybridization because of the extra 
charge deposited at the probe layer. In this regime the hybridization isotherms assume a different 
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FIG. 3: The dependence of the melting temperature of the mutation spot on the concentration 
of the mutation target. Tm(c w , c m ) in °C is plotted vs. c m /c m for various c m + c w = c: (1) c = 10 _6 M 
(2) c = 10 _7 M and (3) c = 10 _8 M. Dots correspond to the melting temperatures T m r w (c w ). 



form exemplified by 

6 i 

^m' = i — ^ — = (^m'mC m + ]f m ' ffi c m ) exp [-IV (1 + 9 m >)] . (10) 
1 — V m i 

For the case of probes and targets of equal and short length T m > = constant xcr m > where <r m > is the 
charge density of the unhybridized m! spot and the constant is set by the ionic strength and the 
length of the probe [l4|- In the following we consider the opposite limit, when the electrostatics 
interactions are screened and the chains do not interact. The hybridization is then described by a 
simple Langmuir isotherm that is, the case of Ti ~ 0. Importantly, in studies of point mutations 
the targets are typically longer than the probes. Usually, the hybridization site is situated roughly 
in the middle of the target. Accordingly, a hybridized probe incorporates two unhybridized target 
sections of similar length. The span of such segment is Rq rs (nal) 1 ^ 2 where n is the number of 
bases in the section, a is the base size and I is the persistence length. Typical values are a w O.Gnm 
and I w 0.75nm |2l| . The Lan gmuir regime occurs when the area per probe, S, exceeds Rq thus 
ensuring that the hybridized probes do not interact. For the target-probe pairs considered this 
condition is satisfied when E > 65nm 2 or less than 1.5 x 10 4 grafted probes per /im 2 . Operation 
in this range allows us to benefit from the absence of the non-linear behavior introduced by the 
exp [-r m /(l + 6 m >)] term. 

The hybridization isotherm is only applicable at equilibrium. In turn, this implies two conditions: 
stationary signal and path independence i.e., independence of the preparation method and sample 
history. The reported times required to attain stationary signal vary between minutes jl8| to 14 
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hours |22L |23| . Importantly, the equilibration time may depend on the probe density hybridization 
conditions, the equilibrium 9i etc. Path independence requires that the measured signal at equi- 
librium will not change with the thermal history (heating and cooling cycles). As stated earlier, 
these conditions are satisfied by the system of Fotin et al ■ Under conditions of thermodynamic 
equilibrium the quantitative methods of analysis we describe below are applicable. However, these 
methods require knowledge of 9i and 0$. In turn, to obtain 9i and fit it is necessary to ascertain 
the saturation value of the signal of the i spot, J™ ax or, equivalently, the Nt of the i spot. 

Numerical implementation of the analysis we describe requires knowledge of the equilibrium 
constants K m i m and K m i w as well as K w i m and K w i w . These can be best obtained experimentally 
from the hybridization isotherm of a spot in contact with single component samples. However, 
this approach is time consuming when a number of equilibrium constants is required. Fortunately, 
in the Langmuir regime it is reasonable to approximate the equilibrium constants by their bulk 
values. The nearest neighbor model, with the unified set of parameters compiled by Santa Lucia 
and collaborators [24l 1251 |2(j , allows to calculate AH and AS* for the hybridization of perfectly 
matched oligonucleotide pairs as well as for pairs containing a single mismatch. In the numerical 
calculations we use probe target pairs incorporating the 12 and 13 th K-ras codons. The AH 
and AS values are obtained by use of HYTHERTM software implementing the nearest neighbor 
approach 27]. The identity of the probes and targets considered in the numerical calculations 
as well as the corresponding standard enthalpies, entropies are listed in Table 1. The table also 
lists the free energies of formation and the equilibrium constants of hybridization at 37°C. The 
hybridization temperatures considered are chosen in view of the conditions in the cited experiments. 
Fotin et al studied the hybridization at the range of T = -20°C to T = 60°C. In the K-ras 
studies the hybridization temperatures were 37°C and 50°C ^(J- The c m values in the numerical 
examples vary around 3 • 10~ 10 M, while the concentration of the control target c w is 10 2 to 10 3 
larger 0- 



III. THE TWO SPOT APPROACH 



When two spots carrying respectively wild type, w' , and mutation, mf, probes are placed in 
contact with a solution of w and m targets, the corresponding isotherms are 

Q w ' — c w K W ' W -t- c Ta K w ' ra (1 1 ) 

£7 m / c w K m ' w -\- CniKfji' 7n (1^) 

where fl; = 9i/ (1 — 9i) of spot i = w', m! and 9i is the corresponding fraction of hybridized probes, 
irrespective of their identity (mismatched or perfectly matched) . These two equations immediately 
determine c w and c m 

Cw = ~K K ~ K K ^ ' 



Accordingly 



(15) 
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FIG. 4: Melting curves of the mutation spot. m i vs. temperature curves are plotted for c m + c w = 

c = 1Q- 7 M and (1) c m /c w = 0, (2) c m /c w = 1CT 4 , (3) c m /c w = 1CT 3 and (4) c m /c w = 1CT 2 . For a given 
c w ~ c the melting temperature and the saturation temperature increase with c m /c w . 



where, in the regime considered, a = fl w //tt m > 3> 1. The range of a values varies between a max = 
K w i w J K m i w 3> 1, corresponding to c. m = and a min — K w i m j K m > m <C 1 when c w = 0. In the 
realistic limit of aK m i w <C K w i w this expression reduces to c m /c w w K w > w /aK m i m that is 

l '7n **-w'w ± ra' ± w ' ± w' i A a\ 

where I m > and I w > are the measured intensities of the m' and w' spots while /™? x and /™ ax are the 
corresponding saturation values. Thus, the implementation of this approach requires knowledge 
of Nt, or equivalently I max , for the two spots. In the general case it is necessary to know four 
equilibrium constants while in the simplest case, of aK m i w K w i w , knowledge of K m i m and K w i w 
is sufficient. 

The two spots approach is feasible when the values of 6 m > and 6 W > for different sets of c m and c w 
are distinguishable. In practical terms this imposes two requirements. First, it is necessary to avoid 
the saturation regimes of the melting curve (Figure 4) and the hybridization isotherm (Figure 5). 
Second, the values corresponding to the sample composition must be large enough in comparison 
with the experimental errors. One may optimize the performance by tuning the hybridization 
temperature T: Increasing T lowers the hybridization degree thus preventing saturation at the 
price of weaker and a higher noise to signal ratio. 

The operational conditions are thus determined by two inputs: The typical c w of the amplicon 
and the desired range of c m /c w . Once these two parameters are specified it is possible to calcu- 
late the relevant melting curves and hybridization isotherms in order to choose the hybridization 
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FIG. 5: The hybridization isotherm of the mutation spot. 9 m i vs. c m curves are plotted at (a) 
T = 37°C (b)T = 47°C for (1) c w = 10" 7 M (2) c w = 10~ S M (3) c w = 10" 9 M and (4) c w = 0. The 
saturation regime can be avoided by increasing the hybridization temperature. 



temperature. As noted before (Eg. I15fl . c m /c w is extracted from a — Q w >/Q, m i. The temperature 
dependence of a is relatively weak (Figure 6). The accessible range of a depends however on the 
detection limit of 9 m i which sets a lower bound of 6 m i, 9 m i(min). Since Q w i < 1 the maximal range 
of a is roughly l/0 m /(mm). 
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FIG. 6: The ratios of concentrations obtainable from a "2-spot" experiment. c m /c w vs. Q, w i /Cl m i 
curves are plotted at (1) T = 37°C and (2) T = 47°C in the range 0.05% < c m /c w < 1%. Inset: Range 
1% < c m /c w < 10%. 



IV. THE TWO TEMPERATURE APPROACH 



An alternative approach involves equilibration of the DNA chip with the biological sample at two 
different temperatures, T\ and T 2 . Focusing on the m' spot, we have 

fi m '(Ti) = c w K m ' w (Ti) + c m K m ' m (Ti), (17) 

fim'(22) = c w K m > w (T2) + c m K m > m (T2). (18) 
Solving equations ((T7|l and (fT5|) leads to 

Qm'{T2)K m i m (Ti) — Q m '{Ti)K m > m {T2) 



K m ' W (T 2 )K m i m (Ti) - K m / w (Ti)K m i m {T2) 



and 

Cm _ Qm>(Tl)K m > w (T 2 ) - Q m >{T2)K m > w (Ti) 



ra'm 



(19) 



c _ Q m '(Ti)K m / w (T2) — Q m '{T2)K rn i w (Ti) 
K m i w (T2)K m i m {Ti) - K m i w {Ti)K m > m (T2) 



(21) 
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FIG. 7: Effect of the concentration ratio on the temperature dependence of fi TO >(Ti)/f2 m /(T2). 
A = n m /(Ti)/fi m /(T 2 ) is plotted vs. AT for the interval 10 < AT < 15 for T\ = 37°C. The concentration 
ratios are (1) c m /c w = 10" 2 (2) c m /c w = 10~ 3 (3) c m /c w = 10" 4 and (4) c m /c w = 0. 



To simplify this equation it is convenient to express AJ ^^ m as A ^^''" (1 — y^Et ) wnere ^2 
7i + AT. Upon defining A = fim'p2)/r2 m '(Ti) we obtain c m /c w in the form 



/ AH m , w AT \ 

Cm _ K rn , m (Ti) exp V « T i t 1+ atJ 



-^m^(Tl) A _ exrj f AH m , m AT \ 
exp ^ RTi Tl+AT y 



(22) 



AT 



Equation l|22|) can be simplified further when AT/Ti <C 1 thus allowing to approximate Tl+Ar ~ 
^r-. In the weak spot limit A = j"^[ti) T"° x (T2) an< ^ saturation signals cancel when I™?*{T) is 
independent of T. When /™, ax (T) does v ary with T it is possible to eliminate this contribution by 
using an appropriate calibration method 18j. Hence, the measurement of the saturation values can 
be eliminated in the weak spot regime. This is of interest when the measurement technique allows 
to study the 9i -C 1 range. 

As we discussed, the optimization of the two spot approach is achieved by tuning a single 
hybridization temperature. In the two temperature approach it is attained by choosing T± and 
AT = T2 — T\. The general requirements are the same: Avoiding the saturation regimes on one 
hand and the high noise to signal ratios on the other. Two additional observations merit comments. 
First, increasing AT magnifies the range of A corresponding to the c m /c w range of interest (Figure 
7). Second, it also allows to avoid the divergence regions in the c m /c w vs. A curves (Figure 8) 
where ultra precise values of A are required. 
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FIG. 8: Ratio of concentrations obtainable from a two temperature experiment. c m /c w vs. 
n m /(Ti)/n m /(T 2 ) for Ti = 37° C and (1) T 2 = 47°C, (2) T 2 = 49°C and (3) T 2 = 51° C. 

V. DISCUSSION 

Studies of somatic point mutations inevitably concern samples with a large excess of wild type 
DNA. This excess is propagated by the PCR amplification unless peptide nucleic acid (PNA) 
clamps [2^ are used. The utilization of DNA chips to characterize the composition of such samples 
is hampered by competitive surface hybridization due to the pairing of the wild type target with the 
mutation probe. This mishybridization contributes significantly to the intensity of the mutation spot 
signal and thus gives rise to false positives. Our theoretical analysis suggests a systematic approach 
to the minimization of such errors. At best one can extract c m /c w thus obtaining additional clinical 
data. At least, this method provides a rational approach for devising criteria for identification of 
false positives. 

DNA chips designed to detect point mutation in solid tumors were developed by Lopez-Crapez et 
al [To) . The sample preparation in this case involves asymmetric PCR or symmetric PCR followed 
by digestion with lambda exonuclease. DNA chips of higher sensitivity for the detection of K-ras 
mutation in stool were reported recently by Prix et al |6( . The amplification method utilizes PNA- 
mediated PCR clamping [2£|. The PNA binds to the wild type codons of interest thus inhibiting 
their amplification. The PNA binding to the mutated DNA is weaker and does not inhibit its 
amplification. The overall result is a selective amplification of the mutated DNA. The methods 
we propose can be used to improve the performance of the approach of Lopez-Crapez et al. In 
this case the c m /c w values may provide supplementary information of diagnostic value. In marked 
contrast, our methods are of little use within the approach of Prix et al, which relies on preferential 
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amplification of the mutated DNA thus avoiding problems due to excess of wild type DNA. 

Our analysis suggested two methods to obtain c m /c w . The advantage of the two-spot approach is 
that it does not involve a change in T. This approach requires however knowledge of the saturation 
value of the two spots. Obtaining the saturation values can be time consuming. However, for chips 
designed for multiple use this step can be performed only once provided the saturation value does 
not change with the hybridization regeneration cycles. The two-temperature approach is attractive 
when using a label free detection scheme such as surface plasmon resonance. In this case the two 
measurements can be carried out with no washing steps. Importantly, Fotin et al [Ts| demonstrated 
the feasibility of studying the temperature dependence of hybridization isotherms even for targets 
labelled with fluorescent tags. 

Thermodynamic equilibrium is a necessary condition for the implementation of the quantification 
methods proposed in this article. The equilibration times of DNA chips and their variation with 
the hybridization conditions are not yet fully understood. The reported equilibration times for 
hybridization on DNA chips vary widely. In comparing the results it is important to note differences 
in hybridization conditions, length of probes, type of chip and detection techniques. Two studies 
are of special interest. Peterson et al reported equilibration times of up to 14 hours [22l |23|. In 
marked contrast, the results of Fotin et al |l8j| suggest equilibration within minutes. In the present 
context, the work of Fotin et al is of special interest since it reports equilibrium hybridization 
isotherms at different temperatures. Importantly, the isotherms indeed satisfy the conditions of 
equilibrium: Stationary state and lack of hysteresis. Furthermore, the system exhibits a Langmuir 
type hybridization isotherm and the thermodynamic parameters extracted from the temperature 
dependence of the hybridization isotherms are in good agreement with the reported bulk values. 
Note however that the hybridization constants describing other types of DNA chips do not always 
exhibit such agreement |29t 13(3 ] . 

The proposed methods require DNA chips obeying Langmuir type hybridization isotherms. In 
turn, this requires spots with relatively low density of probes. Clearly, there are advantages to 
chips carrying spots with high density of probe. One is that high density enable smaller spots thus 
allowing for greater number of different spots. Importantly, in DNA microarrays the high probe 
density gives rise to an electrostatic modification of the isotherms. The resulting nonlinearities are 
undesirable for obtaining quantitative results. 



Acknowledgement 

The authors benefitted from insightful discussions with T. Livache. The work of EBZ was funded 
by the CNRS. 



[1] Lodish H, Berk A, Zipursky L, Matsudaira P, Baltimore D, Darnell J. Molecular Cell Biology. WH 
Freeman 4th Ed 2000. 

[2] Kirk BW, Feinsod M, Favis R, Kliman RM, Barany F. Single nucleotide polymorphism seeking long 
term association with complex disease. Nucleic Acids Res 2002; 30:3295-3311. 

[3] Sidransky D, Tokino T, Hamilton SR, Kinzler KW, Levin B, Frost P, et al. Identification of ras oncogene 
mutations in the stool of patients with curable colorectal tumors. Science 1992; 256:102-105. 

[4] Hibi K, Robinson CR, Booker S, Wu L, Hamilton SR, Sidransky D, Jen J. Molecular detection of 
genetic alternations in the serum of colorectal cancer patients. Cancer Res 1998; 58:1405-1407. 



15 



[5] Sanchez-Carbayo M. Use of high-throughput DNA microarrays to identify biomarkers for bladder 

cancer. Clin Chem 2003; 49:23-31. 
[6] Prix L, Uciechowski P, Bockmann B, Giesing M, Schuetz AJ. Diagnostic biochip array for fast and 

sensitive detection of K-ras mutations in stool. Clin Chem 2002; 48:428-435. 
[7] Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, et al. Molecular classification 

of cancer: class discovery and prediction by gene expression monitoring. Science 1999; 286:531-537. 
[8] Graves DJ. Powerful tools for genetic analysis come of age. Trends Biotechnol 1999; 17:127-134. 
[9] Wang J. From DNA biosensors to gene chips. Nucleic Acids Res 2000; 28:3011-3016. 
[10] Lopez-Crapez E, Livache T, Marchand J, Grenier J. K-ras mutation detection by hybridization to a 

polypyrrole DNA chip. Clin Chem 2001; 47:186-194. 
[11] Lockhart DJ, Dong H, Byrne MC, Follettie MT, Gallo MV, Chee MS, et al. Expression monitoring by 

hybridization to high density oligonucleotide arrays. Nature Biotechnol 1996; 14:1675-1680. 
[12] Li F, Stormo GD. Selection of optimal DNA oligos for gene expression arrays. Bioinformatics 2001; 
17:1067-1076. 

[13] Adamson AW. Physical Chemistry of Surfaces. J Wiley, New York, 4th Ed, 1982. 

[14] Halperin A, Buhot A, Zhulina EB. Sensitivity, specificity and the hybridization isotherms of DNA 

chips. Biophys J 2004; 86:718-730. 
[15] Gyllensten UB, Erlich HA. Generation of single stranded DNA by the polymerase chain reaction and 

its application to the direct sequencing of the HLA-DQA locus. Proc Natl Acad Sci USA 1988; 85:7652- 

7656. 

[16] Gao HF, Tao SC, Wang D, Zhang C, Ma XM, Cheng J, et al. Comparison of different methods for 
preparing single stranded DNA for oligonucleotide microarray. Analytical Lett 2003; 36:2849-2863. 

[17] Higuchi RG, Ochman H. Production of single stranded DNA templates by exonuclease digestion fol- 
lowing the polymerase chain reaction. Nucleic Acids Res 1989; 17:5865. 

[18] Fotin AV, Drobeyshev AL, Proudnikov DY, Perov AN, Mirzabekov AD. Parallel thermodynamic anal- 
ysis of duplexes on oligodeoxyribonucleotide microchip. Nucleic Acids Res 1998; 26:1515-1521. 

[19] Vainrub A, Pettitt MB. Coulomb blockage of hybridization in two-dimensional DNA arrays. Phys Rev 
E 2002; 66:041905. 

[20] Moore WJ. Physical Chemistry. Longman, London 1972. 

[21] Smith SB, Cui Y, Bustamante C. Overstretching B-DNA: the elastic response of individual double- 
stranded and single-stranded DNA molecules. Science 1996; 271:795-799. 

[22] Peterson AW, Heaton, RJ, Georgiadis RM. The effect of surface probe density on DNA hybridization. 
Nucleic Acids Res 2001; 29:5163-8. 

[23] Peterson AW, Wolf LK, Georgiadis RM. Hybridization of mismatched or partially matched DNA at 
surfaces. J Am Chem Soc 2002; 124:14601-7. 

[24] SantaLucia J Jr. A unified view of polymer, dumbbell and oligonucleotide DNA nearest neighbor 
thermodynamics. Proc Natl Acad Sci USA 1998; 95:1460-5. 

[25] Peyret N, Senerviratne PA, Allawi HT, SantaLucia J Jr. Nearest-neighbor thermodynamics and NMR 
of DNA sequences with internal A-A, C-C, GG and T-T mismatches. Biochemistry 1999; 38:3468-3477. 

[26] Bommarito S, Peyret N, SantaLucia J Jr. Thermodynamic parameters for DNA sequences with dangling 
ends. Nucleic Acids Res 2000; 28:1929-1934. 

[27] HYTHERTM version 1.0, Nicolas Peyret and John SantaLucia Jr. Wayne State University available 
at http://ozone2.chem.wayne.edu/Hyther/hythermlmain.html (Accessed April 2004). 

[28] Thiede C, Bayerdorffer E, Blasczyk R, Wittig B, Neubauer A. Simple and sensitive detection of muta- 
tions in ras proto-oncogenes using PNA-mediated clamping. Nucleic Acids Res 2001; 24:983-4. 

[29] Held GA, Grinstein G, Tu Y. Modeling of DNA microarray data by using physical properties of 
hybridization. Proc Natl Acad Sci USA 2003; 100:7575-7580. 

[30] Zhang L, Miles MF, Aldape, KD. A model of molecular interactions on short oligonucleotide microar- 
rays. Nature Biotechnol 2003; 21:818-821. 



